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IMPROVEMENTS IN OR RELATING TO ORGANIC COMPOUNDS 

The present invention relates inter alia, to polynucleotide sequences and their use in 
methods of providing herbicide resistance in plants. In particular the polynucleotides and 
5 methods may be used to confer resistance to herbicides comprising a chemical selected from 
the group consisting of: fomesafen; acifluorfen; chlorimuron ethyl and acetochlor. 

Plants which are substantially "tolerant" to a herbicide when they are subjected to it 
provide a dose/response curve which is shifted to the right when compared with that 
provided by similarly subjected non tolerant like plants. Such dose/response curves have 

10 "dose" plotted on the x-axis and "percentage kill", "herbicidal effect" etc. plotted on the y- 
axis. Tolerant plants will typically require at least twice as much herbicide as non tolerant 
like plants in order to produce a given herbicidal effect Plants which are substantially 
"resistant" to the herbicide exhibit few, if any, necrotic, lytic, chlorotic or other lesions when 
subjected to the herbicide at concentrations and rates which are typically employed by the 

15 agrochemical community to kill weeds in the field. Hereinafter the words (i) "tolerant" and 
(ii) "resistant" when used individually mean "tolerant and/or resistant". 

Herbicide resistant plants are already available within the art for example, 
ROUNDUP READY™ Soya which is resistant to herbicides having as a site of action the 
enzyme 5-enolpyruvylshikimate-3-phosphate synthase, such as those agrochemicals 

20 containing glyphosate. One of the advantages of these plants is that the farmer can apply the 
herbicides to fields containing the resistant crop plants and weeds using "over-the-top 
application", to kill the weeds. 

Other examples of herbicide resistant plants and products and methods for their 
production are shown in International Patent Application Publication Number WO 93/01294 

25 and W099/14337. Here the resistance is achieved by inserting into the plant a 

polynucleotide which provides for the production of a glutathione-S-transferase (GST) 
enzyme which is involved with the detoxification of the herbicide. Glutathione-S-transferase 
enzymes have been shown to exist in various organisms such as bacteria, fungi, yeast, plants, 
mammals and fish and may exist as homo or heterodimers with subunits typically between 

30 24 and 30 KDa. It has been shown that herbicide detoxification is achieved by the 

conjugation of the herbicide with the free thiol glutathione (GSH), a tripeptide (gamma- 



PPD50449 



glutamyl-cysteinyl-glycine) within the plant (Cole D.J. 1994 Pesticide Science. 42 pp209- 
222). Such conjugation is catalysed by GST. Detoxification of herbicides has also been 
shown to occur following the conjugation of the herbicide with homoglutathione, which is 
the predominant thiol in some leguminous species. Homoglutathione (hGSH) is also a 
tripeptide (gamma-glutamyl-cysteinyl-Beta-alanine) but differs from GSH by the addition of 
Beta-alanine instead of a glycine to the gamma-glutamyl-cysteinyl part. 

Thus, the present invention seeks to provide inter alia, novel polynucleotides which 
encode proteins which can be used in methods of providing herbicide resistant plants. 

According to the present invention there is provided a protein comprising the amino 
acid sequence depicted as SEQ ID No. 1 or a protein variant having at least about 70% 
identity therewith wherein the said protein or variant is capable of catalysing the addition of 
Beta-alanine onto gamma glutamylcysteine. In particular the protein 'variant may be 
achieved through conservative substitutions within the amino acid sequence of the active 
protein which substitutions do not significantly adversely affect the activity of the variant 
protein. For example substitutions may be made between the following amino acid groups 
viz. 

(a) Alanine, Serine, Glycine and Threonine 

(b) Glutamic acid and Aspartic acid 

(c) Arginine and Lysine 

(d) Isoleucine, Leucine, Valine and Methionine 

(e) Phenylalanine, Tyrosine and Tryptophan 

It is particularly preferred that the protein variant has a Km for Beta-alanine which is less 
than the said protein variants Km for glycine when calculated using the same method. 
Preferably, the Km for Beta-alanine is less than or equal to about lOmM, more preferably 
5mM, more preferably 4mM, more preferably 3mM, more preferably 2mM, more preferably 
ImM and more preferably 0.8mM. The comparison of the Km with respect to Beta-alanine 
and glycine of the protein variants according to the present invention is calculated using the 
same method. 

It is further preferred that the said protein variant comprises a region having the 
amino acid sequence depicted as SEQ ID No. 2 (KKIQQELAKP) and/or SEQ ID No. 3 
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(CFAGLWSL) and/or SEQ ID No. 4 (VMKPQREGGGNNIYG) and/or SEQ ID No. 5 
(AAYILMQRIFP). The present invention still further provides a polynucleotide encoding 
the protein or protein variant referred to above and it is particularly preferred that the said 

polynucleotide comprises the sequence depicted as SEQ ID No. 6. Proteins and protein 

5 variants falling within this group may be referred to as homoglutathione synthetases. 

In a further aspect of the present invention there is provided a Glutathione-S- 
transferase comprising the amino acid sequence depicted as SEQ ID No. 7 or 8 or 9 or 10 and 
a polynucleotide comprising a region which encodes at least one of the said amino acid 
sequences, preferably the sequence depicted as SEQ ID No. 1 1 or 12 or 13 or 14. The 

10 present invention also provides a polynucleotide sequence which is the complement of one 
which will hybridise to the polynucleotide of the preceding sentence under stringent 
conditions and which polynucleotide sequence still encodes a Glutathione-S-transferase. An 
example of such stringent conditions is hybridisation conducted at 65°C in a solution 
containing 6xSSC, 0.01% SDS and 0.25% skimmed milk powder, followed by washing at 65 

15 °C in a solution containing 0.2xSSC and 0.1% SDS. 

In a further aspect of the present invention there is provided a jpoly nucleotide 
sequence which is the complement of one which binds to SEQ ID No. 1 1 or 12 or 13 or 14 at 
a temperature of between 60°C and 65°C in 0.3 strength citrate buffered saline containing 
0.1% SDS followed by rinsing at the same temperature with 0.3 strength citrate buffered 

20 saline containing 0.1% SDS wherein said polynucleotide sequence still encodes a functional 
Glutathione-S-transferase enzyme with the proviso that the said polynucleotide sequence or 
the amino acid sequence which it encodes, is not a sequence selected from the group of 
sequences listed under accession number; P321 10, U20809, Q03663, P321 1 1, P46421, 
AJ000923, AF004358, AC000348 or AF051238. In particular the polynucleotide sequence 

25 of the preceding sentence is not a sequence selected from the group depicted as SEQ ID Nos. 
25 to 34. 

The present invention further provides a DNA construct comprising in sequence a 
plant operable promoter operably linked to a polynucleotide or polynucleotide sequence 
encoding a GST according to the present invention operably linked to a terminator region. 
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The present invention still further provides a DNA construct comprising in sequence 
a plant operable promoter operably linked to a polynucleotide or polynucleotide sequence 
encoding a protein or protein variant referred to above operably linked to a terminator region. 

ihe present invention still further provides a DIM A construct comprising a first reaion 
comprising in sequence a plant operable promoter operably linJked to a polynucleotide 
encoding a protein or protein variant referred to above arid a terminator region and a second 
region comprising in sequence a plant operable promoter operably linked to a polynucleotide 
or polynucleotide sequence encoding a GST according to the present invention and a 
terminator region. It is particularly preferred that the said first region contains a 
polynucleotide encoding the amino acid sequence depicted as SEQ ID No. 1 and the said 
second region contains a polynucleotide encoding the amino acid sequence depicted as SEQ 
ID No. 10. It is also preferred that the DNA construct referred to above further comprises a 
third region which provides for the production of a protein which acts as a selectable marker 
such as those that provide for antibiotic resistance e.g. kanamycin resistance or those 
providing for herbicide resistance e.g. glufosinate resistance. 

It is preferred that the plant operable promoter within the DNA construct according to the 
present invention is constitutive, tissue specific, inducible or developmentally regulated. 
Such promoters, which are per se not germane to the invention, are well known to the skilled 
person and include, for example such as the constitutive CaMV35S, FMV35S, NOS, OCS, 
Patatin and E9 (derived from the small subunit of RUBISCO) or inducible such as the 
alcA/alcR gene switch described in published International Patent Application No. 
W093/21334; the GST promoter switch described in published International Patent 
Application Nos. WO90/08826 and WO93/01294 and the RMS switch system described in 
published International Patent Application No. WO90/08830. Also the promoter may be 
developmentally regulated and/or tissue specific promoters for example the oleosin, ribulose 
bisphosphate carboxylase-oxygenase small sub-unit promoters. Terminators which can be 
used in the present invention include; Nos and the terminator of a gene of alpha-tubulin (EP- 
A 652,286). 

The DNA construct of the present invention may further comprise (a) transcriptional 
enhancing elements; and/or (b) regions encoding non translated translational enhancing 
sequences, preferably Omega and Omega prime; and/or (c) regions encoding non translated 
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sequences such as intron sequences; and/or (d) regions encoding target sequences which are 
capable of directing transcription products to either intracellular organelles, intracellular 
compartments, cell membranes or to the outside of the cell. 

In a further aspect ofThe present invention there is provided a method of providing 
5 plants which are resistant and/or tolerant to an agrochemical comprising (a) inserting into the 
genome of plant a polynucleotide or a polynucleotide sequence and/or a DNA construct 
which contains a GST according to the present invention; and (b) regenerating 
morphologically normal fertile plants or plant parts therefrom; and (c) applying to said plants 
or plant parts an amount of said agrochemical which is phytotoxic to control like plants and 

10 selecting those plants or plant parts which are resistant to said agrochemical. 

In a further aspect of the present invention there is provided a 'method of providing 
plants which are resistant and/or tolerant to an agrochemical comprising (a) inserting into the 
genome of a plant which plant provides for the production of a functional Glutathione-S- 
transferase, a polynucleotide or a DNA construct comprising a protein or protein variant 

15 according to the present invention; and (b) regenerating morphologically normal fertile 
plants or plant parts therefrom; and (c) applying to said plants or plant parts an amount of 
said agrochemical which is phytotoxic to control like plants and selecting those plants or 
plant parts which are resistant to said agrochemical. 

In a further aspect of the present invention there is provided a herbicide resistant 

20 plants obtainable by the method of the preceding paragraph. 

In a further aspect of the present invention there is provided a method of providing 
plants which are resistant and/or tolerant to an agrochemical comprising (a) inserting into the 
genome of a plant a DNA construct comprising a first region comprising in sequence a plant 
operable promoter operably linked to a polynucleotide encoding a protein or protein variant 

25 referred to above and a terminator region and a second region comprising in sequence a plant 
operable promoter operably linked to a polynucleotide or polynucleotide sequence encoding 
a GST according to the present invention and a terminator region. It is particularly preferred 
that the said first region comprises a polynucleotide encoding the amino acid sequence 
depicted as SEQ ID No. 1 and the said second region comprises a polynucleotide encoding 

30 the amino acid sequence depicted as SEQ ID No. 10; and (b) regenerating morphologically 
normal fertile plants or plant parts therefrom; and (c) applying to said plants or plant parts an 
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amount of said agrochemical which is phytotoxic to control like plants and selecting those 
plants or plant parts which are resistant to said agrochemical. 

In a further aspect of the present invention there is provided a herbicide resistant 
plants obtainecfby the method ot the preceding paragraph. 
5 It is particularly preferred that the agrochemical used in the methods described above 

comprises acifluorfen and/or chlorimuron-ethyl and/or fomesafen and/or acetochlor and/or 
metolachlor. 

The polynucleotides and polynucleotide sequences and DNA constructs described 
above may be inserted into the genome of the selected plants using standard transformation 
10 techniques including particle mediated biolistic transformation, Agrobacterium-mediated 
transformation, protoplast transformation (optionally in the presence of polyethylene 
glycols); sonication of plant tissues, cells or protoplasts in a medium comprising the 
polynucleotide or vector; micro-insertion of the polynucleotide or vector into totipotent plant 
material (optionally employing the known silicon carbide n whiskers'\technique) ? 
15 electroporation and the like. 

In a further aspect of the present invention there is provided plants or plant parts 
obtained according to the methods described above. It is particularly preferred that the plants 
or plant parts are selected from the group consisting of: melons, mangoes, soybean, cotton, 
tobacco, sugarbeet, oilseed rape, canola, flax, sunflower, potato, tomato, alfalfa, lettuce, 
20 maize, wheat, sorghum, rye, bananas, barley, oat, turf grass, forage grass, sugar cane, pea, 
field bean, rice, pine, poplar, apple, peaches, grape, strawberries, carrot, lettuce, cabbage, 
onion, citrus or nut plants. 

In a further aspect of the present invention there is provided the use of a 
polynucleotide or a polynucleotide sequence according or a DNA construct according to the 
25 present invention in a method of producing plants which are resistant and/or tolerant to a 
herbicide comprising acifluorfen and/or chlorimuron-ethyl and/or fomesafen and/or 
acetochlor and/or metolachlor. 

In a further aspect of the present invention there is provided a method of selectively 
controlling weeds in a field said field comprising crop plants and weeds said method 
30 comprising applying to said field an agriculturally acceptable formulation of an agrochemical 
comprising acifluorfen and/or chlorimuron-ethyl and/or fomesafen and/or acetochlor and/or 
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metolachlor or a functionally related analogue thereof, characterised in that the said crop 
plants are the plants obtained according to the methods described above. 

In a further aspect of the present invention there is provided the use of an 

agrochemical comprising acifluorfen and/or chlorimuron-ethyl and/or fomesafen and/or 

5 acetochlor and/or metolachlor or a functional analogue thereof to selectively control weeds in 
a field which field comprises crop plants and weeds comprising applying to said field an 
agriculturally acceptable formulation of said agrochemical in an amount which is sufficient 
to be phytotoxic to said weeds but not said crop plants characterised in that said crop plants 
are the plants obtained according to the methods described above. 

10 In a further aspect of the present invention there is provided a protein comprising the 

sequence depicted as SEQ ID No. 7 or a protein variant having a Smith-Waterman score 
greater than 1094 in the SWISSPROT database calculated using the FASTA3 algorithm 
wherein the said protein variant still encodes a glutathione-S -transferase. 

In a further aspect of the present invention there is provided a protein comprising the 

15 sequence depicted as SEQ ID No. 8 or a protein variant having a Smith- Waterman score 
greater than 619 in the SWISSPROT database calculated using the FASTA3 algorithm 
wherein the said protein variant still encodes a Glutathione-S-transferase. 

In a further aspect of the present invention there is provided a protein comprising the 
sequence depicted as SEQ ID No. 9 or a protein having a Smith-Waterman score greater than 

20 671 in the SWISSPROT database calculated using the FASTA3 algorithm wherein the said 
protein variant still encodes a Glutathione-S-transferase. 

In a further aspect of the present invention there is provided a protein comprising the 
sequence depicted as SEQ ID No. 10 or a protein having a Smith- Waterman score greater 
than 766 in the SWISSPROT database calculated using the FASTA3 algorithm wherein the 

25 said protein variant still encodes a Glutathione-S-transferase. 

The Fasta algorithm referred to above is well known to the skilled artisan and uses 
the method of Pearson and Lipman (Proc. Natl. Acad. Sci. USA 85; 2444-2448 (1988)) to 
search for similarities between one sequence and any group of sequences of the same type as 
the query sequence. Fasta also determines the best segment of similarity between the query 

30 sequence and the sequences in the database , using a variation of the Smith- Waterman 
algorithm. This "local alignment" procedure is described in Chao, Pearson, and Miller 
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(CABIOS 8; 481-487 (1992)). The score for this alignment is reported as the opt and Smith- 
Waterman score. The database used in the above calculations is the SWISSPROT database. 
This database, which is well known and frequently used by the person skilled in the art is 
commercially available from sources such as Geneva biointormatics (GeneBio llM ) S.A 25 
avenue de Champel CH - 1206 Geneva Switzerland. 

The present invention will now be described by way of the following non limiting 
examples with reference to the figures and sequence listing of which: 

Figure 1. Vector pMJB2. 
Figure 2. Vector pMOG800. 
Figure 3. Vector pMOG1051. 

Sequence Listing 

SEQ ID No. 1. Homoglutathione synthetase from Glycine max. 
SEQ ID Nos. 2 to 5. Homoglutathione synthetase protein fragments. 

SEQ ID No. 6. Polynucleotide sequence encoding Homoglutathione synthetase from Glycine 
max. 

SEQ ID Nos. 7 to 10. Glutathione-S-transferases 2.6, 3.1, 3.2 and 3.3 respectively. 

SEQ ID No. 1 1 to 14. Polynucleotides encoding the GSTs 2.6, 3.1, 3.2 and 3.3 respectively. 

SEQ ID Nos. 1 5 to 24. Primers. 

SEQ ID No. 25 Soybean sequence P321 10 derived nucleic acid sequence. 
SEQ ID No. 26 Mungbean sequence U20809. 
SEQ ID No. 27 Tobacco sequence Q03663. 

SEQ ID No. 28 Potato sequence P321 1 1 derived nucleic acid sequence. 

SEQ ID No. 29 Arabidopsis sequence P46421. 

SEQ ID No. 30 Arabidopsis sequence P46421 (genomic). 

SEQ ID No. 31 Papaya sequence AJ000923. 

SEQ ID No. 32 Spruce sequence AF051214. 

SEQ ID No. 33 Wheat sequence AF004358. 

SEQ ID No. 34 Spruce sequence AF051238. 
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Example 

Production of plants with increased herbicide tolerance through increased in planta 
expression of (a) homoglutathione (b) homoglutathione and glutathione S-transferase (GST). 



1.1 Isolation and preparation of polynucleotide sequence encoding hGSH synthetase. 
Degenerate oligonucleotide primer MS-3 and OG9 are used in conjunction with PCR to 
amplify polynucleotide sequences encoding partial length Homoglutathione (hGSH) 
synthetase from first strand cDNA, synthesised using total RNA extracted from soybean cell 
cultures (cv. Mandarin) (Skipsey et aL 1997) 

SEQ ID No. 15 

MS-3 5' GCG AAG CCH CAR MGA GAR GGH GGA GG-3' 

SEQ ID No. 16 

OG-9 5 ' CGC ACT GAG AGA GGA TCC TCG AG 3 5 

The resulting PCR products are cloned into the vector pGEM-T (Promega™), which is 
termed MS3-1. The cloned insert is then excised from MS3-1 using Eco Rl, labelled with 
digoxigenin using standard protocols (Boehringer Mannheim™), and used to screen a 
soybean cell culture cDNA library (cv. Mandarin), constructed in the uni-ZAP XR vector 
(Stratagene™). 

1.2 Isolation and preparation of polynucleotide sequences encoding glutathione S-transferase 
Degenerate oligonucleotide primers CJACON2 and CJACON3 are used independently in 
conjunction with oligonucleotide primer OG9 to amplify DNA probes from first strand 
cDNA suitable for cDNA library screening. 



SEQ ID No. 17. 

CJACON2 5 5 TTC TGG GYK RAS TWC ITY GAC RAI AAG 3' 
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SEQ ID No. IS. 

CJACON3 5 ' GAG TC Y M WK GTS ATT GTT GAA TAC ATT GAT GAG 3 5 



The PCR products obtained are cloned into the vector pCR 2 A using the original TA clonin 
kit (Invitrogen™). Distinct cDNAs are identified by automated DNA sequencing (ABI 377) 
and labelled with digoxigenin in order to facilitate the screening of cDNA libraries for 
cDNAs encoding full length enzymes. 

1.3 Preparation of constructs for use in heterologous gene expression in E. coli 
Full length cDNAs encoding homoglutathione synthetase and glutathione ^-transferase are 
independently expressed in E.coli using the pET expression system (Novagen™). Nco 1 and 
Xho 1 restriction enzyme sites are introduced by at the 5 prime and 3 prime ends of the 
homoglutahione synthetase cDNA respectively using primers MS4-Nco and MS-4-HIS. 

SEQIDNo. 19 

MS4-Nco 5' GCT CTC AAA CCC ATG GCT CAA CC 3' 
SEQ ID No. 20 

MS-4-His 5 ' GCG CTC GAG AGT TAG GTA TAC AGT ATC TAC C 3 ' 

The PCR fragment is then digested with Ncol and Xho 1 and ligated into similarly digested 
pET-24d. This vector is termed pET-MS4-His. 

With respect to the cDNAs encoding glutathione S-transferase, either Nde I or Nco I sites are 
introduced as appropriate at the 5' end, and Bam HI at the 3' end using PCR as described 
previously. The cDNA is then introduced into pET-24a or pET-24d as appropriate. 

1.4 Expression and characterisation of recombinant enzymes in E coli 
Glutathione S-transferase 

The vectors described in section 1.3, harbouring cDNAs encoding GSTs, are transformed 
into E. coli BL21 (DE3) using standard bacterial transformation procedures. Expression and 
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purification of the recombinant GST is performed according to the methods described by 
Cummins et al. y 1997. 



Homoglutathione synthetase 
5 Purification of homoglutathione synthetase is performed using the following method. E. colt 
BL21 (DE3) harbouring the pET-MS4-His plasmid (section 1.3) are grown at 30 °C until 
OD 600 =0.5, after which isopropyl-P-D-thiogalactoside (IPTG) is added to a final 
concentration of 0.1 mM. Following a 3 hour incubation the bacteria are collected by 
centrifugation, re-suspended in buffer A (20mM Tris, 0.5 M NaCl, 5 mM imidazole pH 8.0) 

10 and lysed. Cell debris is removed by centrifugation (10,000 g, 10 min) and the supernatent 
applied to a 5 ml iminidiaceic acid column (Sigma™), previously charged with NiS0 4 and 
equilibrated in buffer A. The column is then washed buffer A containing 20 mM imidiazole, 
followed by buffer A containing 300 mM imidiazole to remove affinity bound protein. The 
protein in this fraction is then concentrated using a Centriplus 30 (Amicon™) spin column 

15 and re-suspended in buffer A prior to application onto a 1 ml HiTrap chelating column 
(Phamacia™), pre-charged with NiS0 4 and equilibrated as described previously. Affinity 
bound protein is recovered using an increasing concentration of 20-200 mM imidiazole and 
the presence of His-tagged recombinant homoglutathione synthetase detected using His- 
tagged antibodies according to standard procedures. Fractions containing recombinant 

20 homoglutathione synthetase are pooled, concentrated using Centricon 30 spin columns 
(Amicon™) and resuspended in 20 mM Tris-HCl pH 8.0, ImM DTT. 

1.5 Assay of enzyme activity of recombinant GST and homoglutathione synthetase . 
GST assays are performed as described by Andrews et al., 1997. 

25 

Homoglutathione synthetase is assayed for activity in 250 mM Tris-HCl pH 8.0, 50 mM 
KC1, 20 mM MgC12, 5 mM DTT, 10 mM ATP, 1 mM y-glutamylcysteine and 50 mM 
glycine or 10 mM P-alanine in a total 100 \xL Experimental controls consist of the exclusion 
of enzyme or glycine/p-alanine. Assays are performed at 30°C for 60 min, with 20 jal 
30 aliquots removed at regular time intervals. Monobromobimane derivatisation is then 



PPD50449 



- 12 - 



performed on the aliquot to determine the presence of either glutathione or homoglutathione 
according to methods described by Cummins et a!., 1997. 



T.6 Production ol plant transformation vector harbouring cDNA encoding homoglutathione 
5 synthetase . 

Oligonucleotide primers hGSH-Ncol and hGSH-Kpnl are used in conjunction with PCR to 
introduce Nco\ and Kpn\ sites at the 5 prime and 3 prime end of the cDNA respectively. 

SEQ ID No. 21 

10 Hgsh-Ncol 5' CCT CTC AAA CCC ATG GCT CAA CC 3' 
SEQ ID No. 22 

hGSH-Kpn 1 5' CGC GGT ACC TCC ATA CAA AGA AAA TCA 3' 

15 The cDNA is then inserted into the vector pMJB2 (Figure 1) as a NcoMKpnl fragment. The 
expression cassette, containing the CaMV35S double enhancer : Glucanase II leader, hGSH 
synthetase cDNA and nos terminator is the excised from pMJB2 using Hind III/ EcoRl and 
ligated into the similarly digested binary vector pMOG800 (Figure 2). 

20 1 .7 Production of plant transformation vector harbouring tandem cDNAs encoding 
homoglutathione synthetase and GST . 

Oligonucleotide primers 3.6-Bgl II and 3.6 Nco 1 are used in conjunction with PCR to 
introduce Nco\ and Bgl II sites at the 5 prime and 3 prime end of the GST 3.6 cDNA 
respectively. 

25 

SEQ. ID No. 23 

3.6-Bgl II 5' GAG ATC TGC AAC AAA CAT AGC CTC 3' 



30 



SEQ ID No. 24 

3.6 Nco 1 5' TAC ACC ATG GCT GAA AGG GAC TTG 3' 
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The GST cDNA is then inserted into the vector pMOG1051 (Figure 3) as a NcoUBgl II 
fragment. The expression cassette, containing the RolD/Fd promoter , GST cDNA and potaio 
PI-II terminator is the excised from pMOG1051 using Bam HI and ligated into the unique 
Bam HI site in the binary vector containing the hGSH expression cassette (section 1.6). 
Orientation of the insert is determined by PCR. 

1.8 Plant Transformation and Regeneration . 

Constructs from section 1.6 and 1.7 are transformed into Agrobacterium tumefaciens strain 
LBA 4404 using the freeze thaw method of transformation provided by Holsters et al., 1 97S. 
Tobacco transformation and whole plant regeneration is performed using var. Samusun 
according to protocols detailed by Bevan, 1984. Transformation events are selected on MS- 
media containing kanamycin. 

1.9 Analysis of Transgenic Plants 
PCR analysis of transformants 

Leaf samples were taken from transformed lines and DNA extracted according to the 
methods described by Edwards et aL, 1991. Oligonucleotide primers are designed to specific 
regions within the transgene to enable it's detection in the plant material tested. 

RNA analysis 

The presence of mRNA encoding the transgene is detected within the plant using Northern 
Blot hybridisation. Total RNA is extracted from leaf tissue using Tri-reagent and protocols 
provided by the manufacturer (Sigma™). Blots are prepared using existing procedures 
(Sambrook, 1989) and probed using radio-labelled cDNA encoding the desired mRNA. 

Protein Analysis 

The presence of recombinant protein in the transgenic plant is determined using Western 
blotting procedures with antibodies raised to the relevant enzyme using standard protocols. 
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Herbicide tolerance tests 

Following tissue culture, transgenic plants are transferred to 5 inch pots containing John 
Innes potting compost no, 3. The plants are allowed to acclimatise (approx. 2 weeks) and 
various herbicides applied at various concentrations to the aerial tissue using a track sprayer. 
Visual assessment of phytotoxicity/plant necrosis is performed 6 and 13 days post 
application. Plants show resistance to herbicides applied at concentrations which are 
phytotoxic to control like plants. 

1.10 Production of homozveous plant lines 

Single copy transgenic plant lines are identified by Southern blot analysis according to 
methods described by Sambrook, 1989 using appropriate radiolabeled probes. Segregation 
analysis is performed on plants containing single insertion events by germination on MS 
media containing kanamycin. Further confirmation of homozygous lines is performed by- 
back crossing transgenic lines with wild-type tobacco and analysis genetic segregation 
following selection on kanamycin. 
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SEQUENCE LISTING 

<110> ZENECA LIMITED 

5 <120> IMPROVEMENTS IN OR RELATING TO ORGANIC COMPOUNDS 

<130> ZENECA CASE PPD50449 

<140> 
10 <141> 

<160> 34 

<170> Patentln Ver. 2.0 

15 

<210> 1 

<211> 499 

<212> PRT 

<213> Glycine max 

20 

<400> 1 

Met Ser Gin Pro Leu Thr Thr Asn Ser Val Leu Val Glu Glu Ala Ala 
15 10 15 

25 Ala Asp Gly Asp Ser Ser Ala Ala Ala Pro Pro Leu Phe Asp Tyr His 

20 25 30 

Arg lie Asp Gin Lys Leu Leu Gin Asn lie Val Tyr Asp Ala Leu Val 
35 40 45 

30 

Trp Ser Thr Leu Asn Cys Leu Leu Val Gly Asp Lys Ser Val Gin Arg 
50 55 60 

Ser Gly Arg Val Pro Gly Val Gly Leu Val His Leu Pro Leu Ser Leu 
35 65 70 75 80 

Leu Pro Gly Pro Phe Pro Glu Ser His Trp Lys Gin Gly Cys Glu Leu 
85 90 95 

40 Ala Pro lie Phe Asn Glu Leu Val Asp Arg Val Ser Leu Asp Gly Lys 
100 105 110 

Phe Leu Gin Glu Ser Leu Ser Arg Thr Lys Asn Ala Asp Glu Phe Thr 
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115 120 125 

Ser Arg Leu Leu Asp He His Ser Lys Met Leu Gin He Asn* Lys Lys 
130 135 , 1^0 



—5 

Glu Asp He Arg Met Gly He Val Arg Ser Asp Tyr Met lie Asp Glu 
145 150 155 160 

Lys Thr Lys Ser Leu Leu Gin lie Glu Met Asn Thr lie Ser Thr Ser 
10 165 170 175 

Phe Ala Leu lie Gly Cys Leu Met Thr Gly Leu His Lys Ser Leu Leu 
180 185 190 

15 Ser Gin Tyr Gly Lys Phe Leu Gly Leu Asn Ser Asn Arg Val Pro Ala 
195 200 205 

Asn Asn Ala Val Asp Gin Ser Ala Glu Ala Leu Ala Lys Ala Trp Ser 
210 215 220 

20 

Glu Tyr Asn Asn Pro Arg Ala Ala He Leu Val Val Val Gin Val Glu 
225 230 235 240 

Glu Arg Asn Met Tyr Glu Gin His Tyr lie Ser Ala Leu Leu Arg Glu 
25 245 250 255 

Lys His His lie Arg Ser lie Arg Lys Thr Leu Thr Glu lie Asp Gin 
260 265 270 

30 Glu Gly Lys lie Leu Pro Asp Gly Thr Leu Ser Val Asp Gly Gin Ala 
275 280 285 

He Ser Val Val Tyr Phe Arg Ala Gly Tyr Thr Pro Lys Asp Tyr Pro 
290 295 300 

35 

Ser Glu Ser Glu Trp Arg Ala Arg Leu Leu Met Glu Gin Ser Ser Ala 
305 310 315 320 

lie Lys Cys Pro Thr lie Ser Tyr His Leu Val Gly Thr Lys Lys He 
40 325 330 335 

Gin Gin Glu Leu Ala Lys Pro Gly Val Leu Glu Arg Phe Val Glu Asn 
340 345 350 
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Lys Asp His lie Ala Lys Leu Arg Ala Cys Phe Ala Gly Leu Trp Ser 
355 360 365 

— 5 Leu Glu A s p Ser ft sp— H^-^/crtr^y s Lys Ala lie G lu Asn Fro GTu Leu 

370 375 380 

Phe Val Met Lys Pro Gin Arg Glu Gly Gly Gly Asn Asn Ile'Tyr Gly 
385 390 395 400 

10 

Asp Glu Leu Arg Glu Thr Leu Leu Lys Leu Gin Glu Ala Gly Ser Gin 
405 410 415 

Glu Asp Ala Ala Tyr lie Leu Met Gin Arg lie Phe Pro Ala Thr Ser 
15 420 425 430 

Pro Ala lie Leu Val Arg Asp Gly Asn Trp Asp Thr Gly His Val lie 
435 440 445 

20 Ser Glu Ala Gly lie Phe Gly Thr Tyr Leu Arg Asn Lys Asp Lys lie 
450 455 460 

lie lie Asn Asn Glu Ser Gly Tyr Met Val Arg Thr Lys lie Ser Ser 
465 470 475 480 

25 

Ser Tyr Glu Gly Gly Val Leu Pro Gly Phe Gly Val Val Asp Thr Val 
485 490 495 

Tyr Leu Thr 

30 



<210> 2 
<211> 10 
35 <212> PRT 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence : Protein 
40 Fragment 



<400> 2 

Lys Lys lie Gin Gin Glu Leu Ala Lys Pro 



PPD50449 

m - i9 



10 



<210> 3 

~~5 <2TT> 8 

<212> PRT 

<213> Artificial Sequence 
<220> 

10 <223> Description of Artificial Sequence : Protein 
Fragment 

<400> 3 

Cys Phe Ala Gly Leu Trp Ser Leu 
15 1 5 



<210> 4 
<211> 15 
20 <212> PRT 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence : Protein 
25 Fragment 

<400> 4 

Val Met Lys Pro Gin Arg Glu Gly Gly Gly Asn Asn lie Tyr Gly 
15 10 15 

30 

<210> 5 
<211> 11 
<212> PRT 
35 <213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence : Protein 
Fragment 



40 



<400> 5 

Ala Ala Tyr lie Leu Met Gin Arg lie Phe Pro 
15 10 
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<210> 6 
<211> 1854 

~5 <212> DNA 

<213> Glycine max 



<400> 6 

ggcggcttgg tttgttctac ttcctcttac 
10 tctaacaaca ccacctttcc cttcccccaa 
ctcaaactca tgtctcaacc tttgaccacc 
gatggtgatt cctccgccgc cgcacctccc 
ctgctccaaa acatagttta cgatgctctt 
ggtgacaaat ctgttcagag atcaggaaga 
15 ctttccttat tacctgggcc atttcctgaa 
cctatattta atgaacttgt tgatcgggtg 
ctctccagaa ctaagaatgc ggatgaattt 
atgctacaga ttaacaaaaa agaggacata 
attgatgaga agactaaatc acttttacaa 
20 gctttgattg gttgtcttat gactggactt 
ttccttggac taaattccaa tagggttcct 
gccttggcta aagcttggag tgagtataac 
caggttgaag aaagaaacat gtacgagcag 
catcatatta gaagcatacg caaaacgttg 
25 ccagatggaa cactttctgt ggatggacaa 
tacacgccaa aggactatcc ttcagaatca 
tcttctgcta tcaaatgccc tacaatatct 
caggaacttg caaagcctgg tgttcttgag 
aaattgcgtg catgctttgc agggttgtgg 
30 gcaattgaaa atccagagct atttgtgatg 
atttatggtg atgagttgag ggaaaccctc 
gatgcagcat acatccttat gcagaggata 
cgtgatggta attgggatac gggtcatgtc 
ttaaggaata aggacaagat tatcattaat 
35 atatcatcat cttatgaagg aggagttttg 
ctaacttgat ggagctaacc ccccaagtta 
tatatatcac cactcaagtc tcctcactcc 
tcttttaatt gttcctatgg gatggtgtct 
attctaacca ataaagattg atgaactgtt 
40 

<210> 7 
<211> 222 
<212> PRT 



actgggatta gaacaaggcg tttgtgcact 60 
caacctcaat cactctcttt cgctaaacct 120 
aactctgttc ttgttgaaga ggctgctgct 180 
ctcttcgatt atcatcgtat cgaccaaaaa 240 
gtctggagca ccctcaactg cctccttgtt 300 
gttcctggtg tgggcctggt acatctccca 360 
agtcattgga agcaagggtg cgaattagct 4 20 
agtttggatg ggaaatttct ccaggaatct 480 
acctcaagac ttttagatat tcattctaag 540 
cgcatgggaa tagttcgttc agattatatg 600 
atagagatga acactatttc cacttcattt 660 
cataagagct tactttctca atatggaaaa 720 
gccaataatg ccgttgatca gtctgcagag 780 
aatcccaggg ctgcaattct ggtcgtggtt 840 
cattatattt ctgcacttct aagagaaaag 900 
accgaaattg atcaggaagg aaaaattctg 960 
gcaatttcag ttgtttactt ccgggctggc 1020 
gaatggagag ctaggctact gatggaacaa 1080 
tatcatttgg ttggcaccaa aaagattcaa 1140 
aggttcgttg aaaacaaaga ccacattgcc 1200 
agtttggaag actcagatat tgttaaaaaa 1260 
aagcctcaaa gagaaggagg aggaaacaat 1320 
cttaaattac aggaagcagg ttctcaagaa 1380 
tttcccgcca cttctccagc aattttggtg 1440 
atttcagaag ctggaatatt tggtacttat 1500 
aacgaaagtg gctatatggt gcgtacaaaa 1560 
cctggttttg gagtggtaga tactgtatac 1620 
tcaaagcaat tcaaaacatt atgtatggtt 1680 
tgattttctt tgtatggagg cattgctgtt 1740 
aattattaac tgtactcaac gacctgtttg 1800 
ctaacaaaaa aaaaaaaaaa aaaa 1854 
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<213> Glycine max 
<400> 7 

Met Ser Ser Ser Gin Glu Glu Val Thr Leu Leu Gly Val Val Giy Ser 

— i 5 i-e r5 



Pro Phe Leu His Arg Val Gin lie Ala Leu Lys Leu Lys Gly Val Glu 
20 25 30 

10 Tyr Lys Tyr Leu Glu Asp Asp Leu Asn Asn Lys Ser Asp Leu Leu Leu 
35 40 45 

Lys Tyr Asn Pro Val Tyr Lys Met lie Pro Val Leu Val His Asn Glu 
50 55 60 

15 

Lys Pro lie Ser Glu Ser Leu Val lie Val Glu Tyr lie Asp As? Thr 
65 70 75 80 

Trp Lys Asn Asn Pro lie Leu Pro Ser Asp Pro Tyr Gin Arg Ala Leu 
20 85 90 95 

Ala Arg Phe Trp Ala Lys Phe lie Asp Asp Lys Cys Val Val Pro Ala 
100 105 110' 

25 Trp Lys Ser Ala- Phe Met Thr Asp Glu Lys Glu Lys Glu Lys Ala Lys 
115 120 125 

Glu Glu Leu Phe Glu Ala Leu Ser Phe Leu Glu Asn Glu Leu Lys Gly 
130 135 140 

30 

Lys Phe Phe Gly Gly Glu Glu Phe Gly Phe Val Asp lie Ala Ala Val 
145 150 155 160 

Leu lie Pro lie lie Gin Glu lie Ala Gly Leu Gin Leu Phe Thr Ser 
35 165 170 175 

Glu Lys Phe Pro Lys Leu Ser Lys Trp Ser Gin Asp Phe His Asn His 
180 185 190 

40 Pro Val Val Asn Glu Val Met Pro Pro Lys Asp Gin Leu Phe* Ala Tyr 
195 200 205 



Phe Lys Ala Arg Ala Gin Ser Phe Val Ala Lys Arg Lys Asn 
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210 215 220 



<210> 8 
"5 <2il> Z35~ 



♦ 



<212> PRT 

<213> Glycine max 

<400> 8 

10 Met Ala Glu Gin Asp Lys Val lie Leu His Gly Met Trp Ala- Ser Pro 
15 10 15 

Tyr Ala Lys Arg Val Glu Leu Ala Leu Asn Phe Lys Gly lie Pro Tyr 
20 25 30 

15 

Glu Tyr Val Glu Glu Asp Leu Arg Asn Lys Ser Asp Leu Leu Leu Lys 
35 40 45 

Tyr Asn Pro Val His Lys Lys Val Pro Val Leu Val His Asn Gly Lys 
20 50 55 60 

Ala lie Ala Glu Ser Met Val lie Leu Glu Tyr lie Asp Glu Thr Trp 
65 70 75 80 

25 Lys Asp Gly Pro Lys Leu Leu Pro Ser Asp Ser Tyr Lys Arg Ala Gin 

85 90 95 

Ala Arg Phe Trp Cys His Phe lie Gin Asp Gin Leu Met Glu Ser Thr 
100 105 110 

30 

Phe Leu Val Val Lys Thr Asp Gly Glu Ala Gin Gin Lys Ala lie Asp 
115 120 125 

His Val Tyr Glu Lys Leu Lys Val Leu Glu Asp Gly Met Lys Thr Tyr 
35 130 135 140 

Leu Gly Glu Gly Asn Ala lie lie Ser Gly Val Glu Asn Asn Phe Gly 
145 150 155 160 

40 lie Leu Asp lie Val Phe Cys Ala Leu Tyr Gly Ala Tyr Lys Ala His 

165 170 175 



Glu Glu Val lie Gly Leu Lys Phe lie Val Pro Glu Lys Phe Pro Val 
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180 



1S5 



190 



Leu Phe Ser Trp Leu Met Ala lie Ala Glu Val Glu Ala Val Lys lie 

195 .200 205 

Ala Thr Pro Pro His Glu Lys Thr Val Gly lie Leu Gin Leu Phe Arg 

210 215 220 



Leu Ser Ala Leu Lys Ser Ser Ser Ala Thr Glu 
10 225 230 235 



<210> 9 
<211> 223 
15 <212> PRT 

<213> Glycine max 

<400> 9 

Met Ala Glu Val Lys Leu His Gly Phe Trp Tyr Ser Pro Tyr Thr Leu 
20 1 5 10 15 

Arg Val Val Trp Thr Leu Lys Leu Lys Asp lie Pro Tyr Gin Asn lie 
20 25 30 

25 Glu Glu Asp Arg Tyr Asn Lys Ser Leu Gin Leu Leu Glu Tyr Asn Pro 
35 40 45 

Val Tyr Lys Lys Thr Pro Val Leu Val His Asn Gly Lys Pro Leu Cys 
50 55 60 

30 

Glu Ser Met Leu lie Val Glu Tyr lie Asp Glu lie Trp Ala His Asn 
65 70 75 80 

Ser Leu Leu Pro Ala Asp Pro Tyr Glu Arg Ala Leu Ala Arg Phe Trp 
35 85 90 95 

Val Lys Tyr Ala Asp Asp Asp Met Phe Ser Ala Val lie Ala Phe Phe 
100 105 110 

40 Leu Ser Asn Asn Asp Glu Glu Arg Glu Lys Ser lie Glu Lys lie Trp 
115 120 125 

Glu His Leu Arg Val Val Glu Asn Gin Cys Phe Gly Asp Gin Lys Lys 
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130 135 140 

Phe Phe Gly Gly Asp lie lie Asn He Met Asp He Ala Phe'Gly Ser 
145 150 155 160 



He Phe Lys He Leu Val Val Ala Glu Asp He Leu Asp Ala Lys Val 
165 170 175 

Leu Glu Asp Glu Lys Phe Pro His Leu His Ser Trp Tyr Asn Asn Phe 
10 180 185 190 

Lys Asp Val Ala Val He Lys Glu Asn Leu Pro Asp His Glu Lys Met 
195 200 205 

15 Val Ala Phe Ala Lys Phe He Arg Glu Lys Arg Leu Ala Cys Thr 
210 215 220 



<210> 10 
20 <211> 232 
<212> PRT 
<213> Glycine max 

<400> 10 

25 Met Ala Glu Arg Asp Leu Arg Leu Leu Gly Ala Trp Phe Ser Pro Phe 
15 10 15 

Ala Leu Arg Val Gin lie Ala Leu Asn Leu Lys Gly Leu Asp Tyr Glu 
20 25 30 

30 

Val Val Glu Glu Thr Leu Asn Pro Lys Ser Glu Leu Leu Leu Lys Ser 
35 40 45 

Asn Pro Val His Lys Lys He Pro Val Phe Phe His Gly Asp Lys Val 
35 50 55 60 

He Cys Glu Ser Ala He He Val Glu Tyr He Asp Glu Val'Trp Ser 
65 70 75 80 

40 Asn Asn Ala Leu Ser He Leu Pro Gin Asn Ala Tyr Asp Arg Ala Asn 

85 90 95 



Ala Arg Phe Trp Val Ser Tyr He Asp Asp Lys Trp Leu Thr Ser Leu 
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100 105 no 

Lys Ser Val Leu Ala Thr Glu Asp Asp Glu Ala Lys Lys Leu His Phe 
115 120 125 

5 ™ — 

Glu Gin Ala Glu Glu Val Leu Glu Lys Val Glu Glu Val Phe Asn Lys 
130 135 140 

Cys Ser Glu Gly Lys Ala Tyr Phe Gly Gly Asp Thr lie Gly Phe Val 
10 145 150 155 160 

Asp lie Gly Phe Gly Ser Phe Leu Ser Phe lie Arg Val Ser Glu Asn 
165 170 175 

15 Met Asn Glu Arg Lys Leu Leu Asp Glu Thr Lys Tyr Pro Gly Leu Thr 
180 185 190 

Leu Trp Ala Glu Thr Phe Ala Ala Asp Pro Ala Val Lys Gly Leu Leu 
195 200 205 

20 

Pro Glu Thr Glu Lys Leu Val Glu Phe Ala Lys lie Leu Gin Leu Lys 
210 215 220 

Trp Ala Ala Ala Ala Ala Ala Lys 
25 225 230 



<210> 11 
<211> 885 
30 <212> DNA 

<213> Glycine max 

<400> 11 

ctgcaatgtc ttcaagtcag gaagaggtga cccttttggg agttgtggga agcccatttc 60 
35 tacacagggt tcagattgct ctcaagttga agggagttga atacaaatat ttggaagacg 120 
atttgaacaa caagagtgat ttgctcctca agtataaccc agtttacaaa atgattccag 180 
tgcttgttca caatgagaag cccatttcag agtcccttgt gattgttgag tacattgatg 240 
acacatggaa aaacaatccc atcttgcctt ctgatcccta ccaaagagcc ttggctcgtt 300 
tctgggctaa gttcattgat gacaagtgtg tggttccagc atggaaatct gcttttatga 360 
40 ctgatgagaa agagaaagag aaggctaaag aagagttatt tgaggctctg agttttcttg 420 
agaatgagtt gaagggcaag ttttttggtg gagaggagtt tggctttgtg gatattgctg 480 
ctgtgttaat acctataatt caagagatag cagggttgca attgttcaca agtgagaaat 540 
tcccaaagct ctctaaatgg agccaagact ttcacaacca tccagttgtc aacgaagtta 600 
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tgcctcctaa ggatcaactt tttgcctatt tcaaggctcg ggctcaaagc ttcgttgcta 660 
aaagaaagaa ttaatatagt gagactcaga atttccatcg aggtttcagt attgtatgaa 720 
atgaaagcta cttgtctatg tttcgttatt gccgttgtat tttcattttt caatgaatta 780 
tgtgatatag gatttctcca tgtcaaaaga tagttcaatt caatcaataa aataaacgaa 840 



tgagtcgtgt tagagcaaaa aaaaaaaaaa aaaaaaaaaa aaaaa 885 

<210> 12 

<211> 899 

<212> DNA 

<213> Glycine max 

<400> 12 

atggcagagc aagacaaggt gatcctacac gggatgtggg ccagccctta tgccaagagg 60 
gtggaattgg cccttaattt taagggcata ccctatgagt atgttgaaga agacctgaga 120 
aataagagtg atttgcttct aaagtacaac cctgttcaca agaaggttcc tgtacttgtt 180 
cataatggaa aggccattgc tgaatccatg gtgatccttg agtatattga tgaaacatgg 240 
aaagatggtc ctaaactgct tccaagtgat tcttacaaac gagcccaagc tcgattctgg 300 
tgtcatttca tccaggatca gttaatggag agcacttttc tagtagtcaa aactgatgga 360 
gaagcacaac aaaaggccat tgaccacgtg tatgagaaac tgaaagtgct agaagatgga 420 
atgaagacct atctgggaga aggcaatgct attatctctg gtgttgaaaa caactttgga 480 
atccttgaca ttgtgttttg tgctttatat ggtgcctaca aggctcatga agaagttatt 540 
ggcctcaagt tcatagtgcc agaaaagttt cctgtgttgt tttcttggtt gatggctatt 600 
gctgaggttg aagctgtgaa aattgcaact cctccacatg aaaaaacagt gggaattctt 660 
cagttgttca ggctgtctgc actgaaa.tct tcttctgcca cagaatgata tatacttcaa 720 
cactttaata gactgtccat cgtttgcttc ttctgcgagt ctttaatgta tgtatctttc 780 
aataacagga tgagtaacac ctgagtatgt aaagcgtgat gatatagaga tatacctcta 840 
tatatcaaat actcttctat aaacacttct ttctttcctt aaaaaaaaaa aaaaaaaaa 899 



<210> 13 

<211> 840 

<212> DNA 

<213> Glycine max 



<400> 13 

atggcagagg tgaagcttca tggattttgg 
accttaaagt taaaggatat accatatcaa 
cttcaacttc ttgaatacaa cccagtatac 
aaacccttat gtgagtccat gcttattgtt 
tcattacttc ctgctgatcc ctacgagaga 
gatgatgaca tgttttctgc agttattgca 
gaaaagagca tagagaagat atgggagcat 
gatcagaaga aattttttgg gggagacatt 
atattcaaaa ttcttgtggt tgcagaagat 



tatagtccct acactttgag ggtggtatgg 60 
aacatagaag aagaccgcta caataagagt 120 
aagaaaactc cagtgcttgt ccataatgga 180 
gaatacattg atgagatttg ggcacataat 240 
gctctggcaa ggttttgggt taaatatgct 300 
ttcttcctta gcaataatga tgaagagcga 360 
ctcagggttg ttgagaatca gtgttttggt 420 
attaacatta tggacatagc ttttgggtcc 480 
attcttgacg cgaaggtcct ggaagatgag 540 
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aaattccctc acttgcattc atggtataat aatttcaagg atgttgcagt tattaaagaa 600 

aacctcccag accatgagaa aatggtcgct tttgctaagt ttattagaga aaaacgtttg 660 

gcatgtacct aagaaagtaa tcttatatga gatcaagtat gaatcacttt gtatctgtct 720 

gaatcgtttt gttatgcgtg tttcttzagt ttccactcca ttattaggat gtcttgacat 780 

3 atctgtgaaa gcaataaaag tttaatcgga tgtactggat taaaaaaaaa aaaaaaaaaa 840 



<210> 14 
<211> 918 
<212> DNA 
10 <213> Glycine max 

<40O> 14 

cataaaactc cacatttcct gctgagtaac ctaacaaaac aaacacaata ttgctccgtg 60 

tttgacctgt tatagtaaac agtgatcgct gaaagggact tgaggctttt gggtgcttcg 120 

15 ttcagtccat ttgccctgag ggtgcagatt gcccttaacc tcaagggttt ggattatgag 180 

gttgttgaag agactttgaa tcccaaaagt gaattgcttc ttaagtccaa ccctgtgcac 240 

aagaaaatcc cagttttctt ccatggagat aaagtcatat gtgaatctgc aatcatagtt 300 

gagtacatag atgaggtttg gtccaacaat gctctctcca tccttccaca aaatgcatat 360 

gatcgagcta atgcccgatt ttgggtttct tacatcgatg acaagtggct tacgtccttg 420 

20 aaaagtgttc tagcgactga agatgatgag gcaaagaagc tacactttga gcaagcggaa 480 

gaagtgcttg agaaggtgga agaagtgttc aacaagtgca gtgaagggaa ggcctatttc 540 

ggaggagata cgattggatt tgttgacatt ggttttggaa gctttttgag tttcattaga 600 

gtctcagaga atatgaatga aagaaaattg cttgatgaaa cgaagtaccc tggtttgacc 660 

ctatgggctg aaacttttgc tgctgatcct gctgtgaagg gccttctgcc agagactgaa 720 

25 aagcttgttg agtttgcaaa gattcttcag ctaaaatggg ctgctgcagc tgctgcaaac 780 

taaatggaat caaattaatt gctggatgaa tttcaaaaat tgttgtgcaa gttatttata 840 

tctgaggcta tgtttgttgc aactttatat atttaaaagt caaaataaat gttatgataa 900 

tatagtaaaa aaaaaaaa 918 



30 <210> 15 
<211> 26 
<212> DNA 

<213> Artificial Sequence 



35 <220> 

<223> Description of Artificial Sequence : PRIMER 



40 



<400> 15 

gcgaagcchc armgagargg hggagg 2 6 

<210> 16 
<211> 23 
<212> DNA 
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<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : PRIMER 

-5 " " _ " " " ~~ " ^ " 

<400> 16 

cgcactgaga gaggatcctc gag 

<210> 17 
10 <211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

15 <223> Description of Artificial Sequence : PRIMER 
<400> 17 

ttctgggykr astwcityga craiaag 

20 <210> 18 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
25 <220> 

<223> Description of Artificial Sequence : PRIMER 
<400> 18 

gagtcymwkg tsattgttga atacattgat gag 

30 

<210> 19 
<211> 23 
<212> DNA 

<213> Artificial Sequence 

35 

<220> 

<223> Description of Artificial Sequence : PRIMER 

<400> 19 
40 cctctcaaac ccatggctca acc 



<210> 20 
<211> 31 
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<212> DNA 

<213> Artificial Sequence 
<220> 

"~ 5 <223> Description of Artificial Sequence : PRIMER " 
<400> 20 

gcgctcgaga gttaggtata cagtatctac c 31 

10 <210> 21 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
15 <220> 

<223> Description of Artificial Sequence : PRIMER 
<400> 21 

cctctcaaac ccatggctca acc 23 

20 

<210> 22 
<211> 27 
<212> DNA 

<213> Artificial Sequence 

25 

<220> 

<223> Description of Artificial Sequence : PRIMER 



<210> 23 
<211> 24 
<212> DNA 
35 <213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence : PRIMER 
40 <400> 23 

gagatctgca acaaacatag cctc 24 



<400> 22 



30 



cgcggtacct ccatacaaag aaaatca 



27 



<210> 24 
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# 



<211> 24 
<212> DNA 

<213> Artificial Sequence 
5 <220> 

<223> Description of Artificial Sequence : PRIMER 
<400> 24 

tacaccatgg ctgaaaggga cttg 24 

<210> 25 
<211> 2763 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : SOYBEAN Derived 
nucleic acid sequence P32110 

20 <400> 25 

ccatacaaac aatacataac taaataactg taattttatg ttttctttca ttgtattgtg 60 
cgttttatat taatgtatta attatgttgt tttttaattt ttattgataa attatttaaa 120 
ctttaaataa aaaacattta aagtaaacta taataaaaat atatataaca tgttttaaat 180 
aattaaacat taaaataaaa taataacata tcaaataaaa tcacaaaaaa tatactaatg 240 

25 aaatcaactt aatattattt attatatatt taaatatata ttttttaact tttttatttt 300 
aaaaaaataa aattgttaac attataaaat taaaaaacaa atttatgaaa aaaaaggtta 360 
acaaattaaa aacaaactac aaatagtaaa ataacgtttt taaaaaaaaa aggaggttac 420 
gatctcaaaa tcgtatacct catgaaattt ttaaaaaata agtaagacag ttttacaact 480 
ttaaagtaat aaaacaaaat acgagttttt ttttaatacg actttaaagt cgtaaaaaaa 540 

30 ttatatttac attgaagttg taaataatat ttttacgatt tcagttaaaa aaataagaaa 600 
actaaaaagt cgtaataggt gctaacaact gttacaactt aacccgtttt agataatttt 660 
ttaaaatata ccccaaatga aaaattgcgg attattttat ccccaattgg taaaaaagca 720 
cctgaatact agaaacaagg gtggcaaaga ttcaaagtaa tgaatttata aaaaaaaaaa 780 
gaaaatattt atataattac acaaaatctt aaaaaatgtt gttattattt actactaatt 840 

35 tataattcga atcgagtata tttaaggcaa agttctctac gcggtttacg taatctctta 900 
catcatttat ttttccacgt ttgtttactc tgaggcatct tcttgatagg ggaaagtttt 960 
ttcatatttt ttctagatcc ttctcatgtg caatgtctcc gcattcatac gcagcaatca 1020 
aaatggaatt aatacacggt agcactttcc catttttttg ttatcgtccc cactattgac 1080 
tatacccatg atatgtatat aggtatatct tttctaattg attacgtatc tgctgaatac 1140 

40 tagctagtcc tgattcctag ctctataaaa ggagaatacc ataggaattc atcacagaca 1200 
aacaaacaat ttaccagcta tacttgttcc ttttgaaggt tagaagtgct acaatacaaa 1260 
caatggcagc tactcaggaa gatgtgaagc ttttgggtat tgtgggaagc ccatttgtgt 1320 
gcagggtcca gattgccctt aagttgaagg gagttgaata caaatttttg gaagaaaatt 1380 



10 



15 
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tgggcaacaa gagtgatttg cttctcaaat acaaccctct tcacaagaag gttccagtgt 1440 
ttgttcacaa tgagcagccc atagcagagt czcttgtgat tgttgaatac attgatgaga 1500 
catggaagaa caaccccatc ttaccttctg atccttacca aagagccttg gctcgtttct 1560 
ggtccaaatt cattgatgat aaggtaactc aacatttcaa aaatcttctt agttttcatg 1620 

— 5 atttgtgctg attt gtcagc aaaacatcac gatgaaatct ataratgtga aatctttctg rSBXT 

gtgtggaata tatatgtgaa atctttgaat atgttagaga actcaaaagt caacagccaa 1740 
ccatgatttt tttttaatgt atcaactttt tg~aaaacaa tatragtgat ttgaaacttt 1800 
atggaatcat atactatgat tttgggagaa atcttatttt attttactat ttttatctgg 1860 
ggtgggggga ttctgtcacg tatttgtttc tacataactc gatctaaatt ctgtttgtcc 1920 

10 taatcacttt atgaaataat tactaataaa tartgtgatt tgcgaaatca gattgtgggt 1980 
gctgtatcga aatctgtttt cacggttgat gagaaagagc gtgagaagaa tgttgaagaa 2040 
acatatgagg ctcttcagtt tcttgagaat gagctgaagg acaagaagtt ttttggagga 2100 
gaggaatttg ggttggtaga tattgctgct grcttcatag cattttggat cccaattttt 2160 
caggaaatag cagggttgca gttattcacc actgagaaat ttcciatact ctacaaatgg 2220 

15 agccaagaat tccttaacca cccttttgtg cacgaagtcc ttcctcctag agacccactt 2280 
tttgcctact tcaaagcccg ctatgaaagt ctttctgctt caaaatagac ttatttaagg 2340 
atatttgttg aacaacttgt gtcttgttga gttattgctg tttgaatttc atgtaaaatg 2400 
atactagcta tatgtaaatc ccagaaaaaa aaaaaaaaga atcctaggat cttgttttcg 2460 
ttttggccat ttcagtatat aaagaaatta tatttttcga tataaatttt gttgtgaaaa 2520 

20 gctttattct tccttcataa aatccttcaa tgtgcataat cttattcgta gagagactta 2580 
gagcggctag tagctactac cttgaaattt ttuctttaat tcgaaggaca acgtatatat 2640 
tatataataa taattattgc aagttggaaa tcgtgtaagc atgtttcatg actatatgag 2700 
ttaacaatat actgtcttgc ctctgcaacc ttcatggatt ctaaaattat tcccttggct 2760 
gca 2763 

25 

<210> 26 
<211> 1137 
<212> DNA 

<213> Artificial Sequence 

30 

<220> 

<223> Description of Artificial Sequence : Mungbean 
Sequence U20809 

35 <400> 26 

gaattaaagt gcttcgatgg gttcaagtca ggaagaagtg acccttctgg aattattgga 60 

agcccatttg tttgcagggt gaagatagcc ctgaagttga agggagttga atacaaatac 120 

gttgaagaaa atttccgcaa caagagtgaa cagcttctga aatacaaccc agttcacaag 180 

aaggttccag tgtttgttca tggtgacaaa ccccttccag agtcccttgt gattgttgag 240 

40 tacatcgatg agacatggaa caacaacccc atcttggctt ctgatcctta ccagagagcc 300 

ttggctcgtt tctggtccaa attcatcgat gacaagattg tgggtgcttc gtggaaatct 360 

gttttcacag ttgatgagaa agagcgtgag aagaatattg cagaaacata tgagagtctg 420 

cagtttcttg agaatgagat aaaggagaag aagttctttg gaggagaaga gcttgggttg 480 



PPD50449 



-32- 



gtagatattg ctgctgtcta tgtagcattt tggatccctt tgattcaaga aatagcagga 540 
ttggagttat tgacaagtga gaaatttcct aatctctaca ggnggagcca agaatttttg 600 
aaccatccaa ttgtcaaaga aagtcttccc cctagagacc cagtttttgc ctttttcaaa 660 
ggacgctatg aaggcctttt ttcttcgaaa tagatttcat gttgtgagag atttagaati 720 
~~5 tataaggaaa attgtgtgga gtacttagtt aggatttggt ttcaaaatta tggttgaagt 780 
tgaatcctag gatttgcgca tgtcaaacaa ataacctggg attgttcgtg ttgatatttt 840 
actatttcaa tcaataaatt atgcagcttc ttaccgagtt aacattcgat cgaaataagg 300 
accaacaaga ttaagtaagg ctgcattatt tgtctttttg ttaaattaga tattagtatg 360 
caccaaaaag tgagtatttc cttacagaag ctttttaaat attaagtagt taattccata 1020 
10 ggtctaccat tatagctcaa gttatataca tattatgggt gccattctct actcaacaat 1080 
tatgactata aaatcttgtg gttataatgc cacgaacaag tgaactatct cacttca 1137 



<210> 27 
<211> 2038 
15 <212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence : Tobacco 
20 sequence Q03663 



<400> 27 

ctcgaggatt tcaaactcta gcttcactaa 
aaaacgaaat aaacataagc tatttacaaa 

25 gtcaagtcgt gattgggata agaaaataga 
ttggaatgag ataatagaaa agtatgatag 
ggaaggaaat tagagaaaga actgaagaat 
agttatttaa cgtcatcgag atgacggcca 
gaaagaagaa gacacaaact gtgt.ttggta 

30 ttgtacatat aagaaatata atataagatt 
tcaaagtatt tattttaaat tctttttcca 
tataaatatg gaagtaaaaa ttaatcagat 
ttaaccataa ccagtctact ttatttaaca 
ttaacttcat gcattgacaa tttaaaatta 

35 acaattggta actgcatata taagtttaat 
tataacagga tatattacat tgatattacc 
atcaccgtga aatcttcaag atttctccta 
ccattcaaga tacaacattt ctcctatagt 
ttcatttctt cttgtctcta cacttctctt 

40 ccaaaattct caacaagact atttggatgc 
agaacctttg acctgggacg accaggtagc 
ggctgcagat tgtaacctcg tacattctca 
aagtggcgat ttcatgacgg ctgctaaggc 



aacttgagct ttcttttcca ctaatgtcga 60 
aaataaaaaa atactccatt tgaatctaaa 120 
aatttattta tact cca gat caagccgtga ISO 
tacatgagta acatcaagtt ggaaattaag 240 
atccaaatat tctttacgtc caaatttgat 300 
tgttcaagtt ttccacaaat attgagaaaa 360 
ttattatagt tttttctttt agagaattga 420 
tagaaataag attattagaa aaatcaaaca 480 
atggacattc ccattctgaa aaaaaagaga 540 
cgttaaatgt agaaaatatt aattaacaca 600 
aaaagcacat ctgaratarc aaaaaagtgt 660 
ttttgcaaca tcgggtaaaa ctattttaca 720 
atggtaacct agaaaatagg ataaattatc 780 
atgtcaaaaa atttagtaag tacatgaata 840 
taaataccct tggtagtaaa tctagttttt 300 
catgggattt gttctctttt cacaattgcc 960 
attcctagta atatcccact cttgccgtgc 1020 
ccataacaca gctcgtgcag atgtaggtgt 1080 
agcctatgcg caaaattatg cttcccaatt 1140 
tggtcaatac ggcgaaaacc tagctgaggg 1200 
tgttgagatg tgggtcgatg agaaacagta 1260 
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ttatgaccat gactcaaata cttgtgcaca aggacaggtg tgtggacact atactcagct 1320 

ggtttggcgt aactcggttc gtgttggatg tgctagggtt cagtgtaaca atggagaata 1380 

tgttgtctct tgcaactatg atcctccagg taattataga ggcgaaagtc catactaatt 1440 

gaaacgacct acgtccattt cacgttaata tgtatggatt gttctgcttg atatcaagaa 1500 

_5 c 1 1 a a a t a-a^t — t-ge t c ta-aaa ggcaacttaa agtca a gtat atagtaatag tactatattt 1560 

gtaatcctct gaagtggatc :ataaaaaca ccaagtggtc ataattaagg ggaaaaatat 1620 

gagttgatga tcagcttgat gtatgatcug atattattat gaacactttt gtactcatac 1680 

gaatcatgtg ttgatggtct agctacttcc gatattacga gcaaaattct taactacatg 1740 

ccttaggaac aagcttacac agttcatata atctactaga gggccaaaaa catgaaaati 1300 

10 accaatttag atggtaggag gatattgaaa gtggagcagc tagttttaat aactgaccgt 1360 

tagtcttaaa attgacggta -aaaaatatt tacataatca ggtcatttat aaggtaatta 1S20 

taggtaaata tttatgacga attctcaata gtaatctgaa aaaaaattgt aactaaccta 1980 

ttatactaaa actactataa caggttagan tacattaatc atgtcattag aagatctt 2038 



15 <210> 28 

<211> 2796 
<212> DNA 

<213> Artificial Sequence 



20 <220> 

<223> Description of Artificial Sequence : Potato- Derived 
nucleic acid sequence P32111 



<400> 28 

25 aagcttacat tcaacgtgtt gctgcttcaa aataagggtc tttacaaaat ttaaaaaata 60 

taaagaagaa ttttgaattt tataattaaa gccgtcagaa aggacttgac ctttgaagcc 120 

aacccaactc aatattagaa aatcaaaaat tttagtgcat tcatttataa aaaaaaaaaa 180 

aattacttat gcagttcttg aaccctttgt gagacgagag ggagttgctc ggatggtaag 240 

cacccttcac tttcaacccg aaggttgcga gtttgagtca ccaacggagc aaaaagggta 300 

30 ggagctccta gaaagggtaa aaaaaaaaaa aaaaattaat aaaaaaatac cctttatgaa 360 

atttctcatt ccgctactgc acttctcccc tgatcttcct cgtgttttca attattaatt 420 

ctatattcat gacaccatgt gatgtttctc tgggtagtcc taaaaataga ggtattgaaa 480 

attatgttgt ttctctctgg tctatttact tttcttgtgt actttattgt atttcatatt 540 

gttaattttt ggcttcgttt i:ataacatgt tattagcaca aaactttaat catatcgagt 600 

35 taaactttta attttgctta tcaacgtaaa agacaagata tgtgatcggc atgtataact 660 

atgttttaat taggtataat acataaatat ttcctttaat tttatctcat tttatattta 720 

tgtcgtttaa ctttggagcg tacaagtgtc tacttgtgat ccagtaggtt atcaaagctc 780 

gcatgcttag ttttactttc caaattgaaa tttatattag aattgaattc aggaagaatt 840 

ttgtaggttc aactaaatta tatatatata tataaaaaaa taaaaattat tagacgcttc 900 

40 gactatttac ttactttaaa atttgaattt tcgtacgaat aaaattattt gtcagagaaa 960 

agtcttttag ctattcacat gctaggaagt ttcacttttg gtggatcagt gattgtatat 1020 

tatttaatat atatcaattt tctcatcaaa ctgaaaatga aagataaaat taatattaaa 1080 

aactccattc attttaattt attgtcatgt tttgacttga tccaaaatct aacaatttaa 1140 
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aaggttttaa atttttgtgc ttttttttaa attaaaaata tgtcaaatat attaaaatat 1200 

attttttaaa ttttatacta aaaaacatgt cacatgaata tttgaaatta taaaattatc 1260 

aaaaataaaa aaagaatatt tctttaacaa attaaaaatg aaaatatgat aaaiaaatta 1220 

aactattcta tcattgattt ttctagccac cagatttgac caaacagtgg gtgacatgag 1330 

-ea-e ataagtc atcttta t tg tatttt a tto ctcactccaa a a a -t-a-bee-gtg-a- a t at g tt ta. 144-0- 



ctacttaatt tagtcaaata taattttata ttagaataat tgaatagtca aacaagaaac 1500 

tttaatgcat ccttattttt tcctctataa aaaaaagact agacaccaag ggagaccaac 1560 

cacacataat taagatggca gaagtgaagt tgcttggtct aaggtatagt ccttttagcc 1620 

atagagttga atgggctcta aaaattaagg gagtgaaata tgaatttaca gaggaagatt 1630 

10 tacaaaataa gagcccttta cttcttcaat ctaatccaat tcacaagaaa attccagtgt 1740 

taattcacaa tggcaagtgc atttgtgagt ctatggtcat tcttgaatac attgatgagg 1800 

catttgaagg cccttccatt ttgcctaaag acccttatga tcgcgcttta gcacgatttt 1860 

gggctaaata cgtcgaagat aaggtatatt gcttttaagt tatcccaatt gattgaaaag 1S20 

tttgttttag ttacgttatt acatatactt taggtctcat gctttttaat aatcttttat 1930 

15 aaaattcgac taagacgaac ttctcgtata gtcaacaata ctaacatatt tgtctagtag 2040 

ttggttagga aataagttat ccgaatatta aattctggat aagtaatgaa taccatattt 2100 

gatagttgat ttggagataa attattcgtg tataaaatta atatgatatt tgatttgcaa 2160 

tttagaaata cataactatt ttatatgcat agatccatta taactaattg atatattatt 2220 

aatatctgta taactctaac cagctatcga aacgagtcaa cgaaccttat taagttttgt 2280 

20 ttgttgggca gggggcagca gtgtggaaaa gtttcttttc gaaaggagag gaacaagaga 2340 

aagctaaaga ggaagcttat gagatgttga aaattcttga taatgagttc aaggacaaga 24G0 

agtgctttgt tggtgacaaa tttggatttg ctgatattgt tgcaaatggt gcagcacttt 2460 

atttgggaat tcttgaagaa gtatctggaa ttgttttggc aacaagtgaa aaatttccaa 2520 

atttttgtgc ttggagagat gaatattgca cacaaaacga ggaatatttt ccttcaagag 2580 

25 atgaattgct tatccgttac cgagcctaca ttcagcctgt tgatgcttca aaatgagtat 2640 

acctcaagtg aatttcaaga ttttgtgtgg caataaaaat tgagtttttg taaattcaat 2700 

tgaaatatat taaagttgca tgttataaga tttatcttta tttcactagt taatataaat 2760 

tttggattca cgtataaata aaagtattgt taagag 2796 

30 <210> 29 

<211> 1289 
<212> DNA 

<213> Artificial Sequence 
35 <220> 

<223> Description of Artificial Sequence : Arabidopsis 
P46421 

<400> 29 

40 aagtcaaggt acacatgctc cagggataag gcaaggttag gaattaggac acatctccac 60 

ttactaaaaa agagataaaa aaaaattgta tagggaacgt tataaatatg ttgtaaagtc 120 

aacatctgtt tccttctaga ctcttcgcat ttacatcaca ctgccgacca tataaaacgg 180 

caaagttcgt cgtcgtttta tcacaagacc atcaacacca taaggctata aatccaagct 240 



-35- 



aaaaggtagt gattaactcc acaaaaccag aaaaactaca tttctaacat atagaagaaa 3C0 

cagagaaaaa gagagagaga cccctaatgg ctgagaaaga agaagtgaag cttttgggga 360 

tatgggcgag cccttttagc cgtcgggtcg acatggctct caaactcaaa <5gcataccgt 420 

acgagtacgt ggaagagata ctggagaaca aaagcccttt gcttcttgct cttaacccta 480 

"3 ttcacaagaa agtccctgtt cttgtccaca atggtaaaac cattctcgag tctcatgtga 5T0 

ttcttgaata catcgatgag acttggccac aaaatccaat tctccctcaa gatccttatg 600 

aaagatccaa agctcgtttc tttgctaaac tcgtcgatga acaggtaatt gaattggttc 660 

aaaattgcat gtcaaataat aaacaattgg ttctgctttg ttaatttatc aaacaagtaa 720 

ttttctatta acattagcga ttatatgtct ctgtcattgt agattatgaa cgtggggttt 780 

10 atatcaatgg caagagcaga cgagaaagga agagaagttt tagccgagca ggtaagagaa 840 

ctgattatgt atctcgagaa agaacttgtc ggaaaagatt acttcggagg caaqactgtc 900 

ggattcttgg actttgtcgc cggaagttta attccgtttt gtttggagag aggttgggaa 960 

ggaataggat tggaagtgat tacagaggag aagtttccag agttcaagag atgggttagg 1020 

aatttggaga aggttgagat tgttaaagat tgtgttccac caagagagga acatgtagaa 1030 

15 cacatgaact atatggcaga gagagtgaga tcttcttaag aaaacaaatc atgtttagtt 1140 

cttgatcatg caatgtttgt atggttatgt tgttgtttat tttattgaat atctttgtat 1200 

gttgtgtggt tgagaagtga ggttttatca tcatctctca cgttatctta tttggtccca 1260 

gccactattt agaattaatg gtaaagctt 1289 



20 <210> 30 

<211> 1339 

<212> DNA 

<213> Artificial Sequence 



25 <220> 

<223> Description of Artificial Sequence : Arabidopsis 
Genomic sequence 



<400> 30 

30 gaattaattc ttgacgaagc atggggttgt cttcagaaaa tgaatggatc aagtcaaggt 60 

acacatgctc cagggataag gcaaggttag gaattaggac acatctccac ttactaaaaa 120 

agagataaaa aaaaattgta tagggaacgt tataaatatg ttgtaaagtc aacatctgtt 180 

tccttctaga ctcttcgcat ttacatcaca ctgccgacca tataaaacgg caaagttcgt 240 

cgtcgtttta tcacaagacc atcaacacca taaggctata aatccaagct aaaaggtagt 300 

35 gattaactcc acaaaaccag aaaaactaca tttctaacat atagaagaaa cagagaaaaa 360 

gagagagaga cccctaatgg ctgagaaaga agaagtgaag cttttgggga tatgggcgag 420 

cccttttagc cgtcgggtcg agatggctct caaactcaaa ggcataccgt acgagtacgt 480 

ggaagagata ctggagaaca aaagcccttt gcttcttgct cttaacccta ttcacaagaa 540 

agtccctgtt cttgtccaca atggtaaaac cattctcgag tctcatgtga ttcttgaata 600 

40 catcgatgag acttggccac aaaatccaat tctccctcaa gatccttatg aaagatccaa 660 

agctcgtttc tttgctaaac tcgtcgatga acaggtaatt gaattggttc aaaattgcat 720 

gtcaaataat aaacaattgg ttctgctttg ttaatttatc aaacaagtaa ttttctatta 780 

acattagcga ttatatgtct ctgtcattgt agattatgaa cgtggggttt atatcaatgg 840 
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caagagcaga cgagaaagga agagaagttt tagccgagca ggtaagagaa ctgattatgt 900 
atctcgagaa agaacttgtc ggaaaagatt acttcggagg caagactgtc ggattcttcc 960 
actttgtcgc cggaagttta attccgtttt gtttggagag aggttgggaa ggaataggat 102C 
tggaagtgat tacagaggag aagtttccag agttcaagag atgggttagg aatttggaga 10SC 
~~ 5 aggttgagat tgttaaagat tgtgttccac caagagagga acatgtagaa cacatgaact 114C 
atatggcaga gagagtgaga tcttcttaag aaaacaaatc atgtttagtt cttgatcatg 12C: 
caatgtttgt atggttatgt tgttgtttat tttattgaat atctttgtat gttgtgtggt 126C 
tgagaagtga ggttttatca tcatctctca cgttatctta tttggtccca gccactattt 132C 
agaattaatg gtaaagctt 133? 

10 

<210> 31 
<211> 968 
<212> DNA 

<213> Artificial Sequence 

15 

<220> 

<223> Description of Artificial Sequence : Papaya AJ000923 



* 



<400> 31 

20 tagaactagt ggatcccccg ggctgcagga 
gctccgtttt acaacaatgg cggacgaggt 
tggaatgaga atcagaatcg ctttagccga 
gaatctgaga aacaagagtc ccttactcct 
ggttctcatc cacaatggta aacccatctg 

25 tgaggtatgg agcgacaagg ctcctctgct 
caggttctgg gctgactatg ttgacaagaa 
gactaaaggg gaagaacagg agggggccaa 
ggagggagaa cttggggaga agccttattt 
gacttttatc ccattctaca cttggttcag 

30 agaggcagaa tgccccaagt tgtttagttg 
ttcaaaatct ctgcctgatc aagataaggt 
tcttgggatt tgagtttttt gagagacttc 
gtcctccaag tattgattaa aaaaggttct 
actgttgtat tggaataaag gggtgtcctt 

35 tgtggtgacg tcaatttctt gtgtgttgta 
tcatatgt 



attcggcacg agagatttta tcttttagga 60 
tgttctcttg gatttctggc caagcccttt 120 
gaagggtatt cactacgagt acaaggaaga 180 
gcagatgaac ccggtacaca agaaaatccc 240 
tgagtctttg atccagattc agtacataga 300 
tccctctgat ccttatcaga gagctcaagc 360 
gatgtatgaa gctgggagga gagtttggac 420 
gaaagagttc atagaaatct tgaagacttt 480 
tggtggggaa agttttgggt atgtggattt 540 
tgtgtatgaa agttttggga agatgagcat 600 
ggtgaaaagg tgtttggaga aggagagtgt 660 
atacggcttc gttttggaac tcaggaaggc 720 
aaaatccttg ttccatttcc attagggttc 780 
ggatcaacta ctttatttgt ctagtctttt 840 
ttgtggatta ggtgagattt ctatcaatat 900 
ggcaaatcat atttgaataa aatctttctt 960 

968 



<210> 32 
<211> 1040 
40 <212> DNA 

<213> Artificial Sequence 



<220> 
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<223> Description of Artificial Sequence : Spruce AF051214 
<400> 32 

gaaaagacag atacatagaa agttagagac agttacaata cattttcaga gcaatcgcga 60 
5 tggaggcctg tggagaagag gcgcaagtga agctgttggg tgggaatata agtccctttg 120 
tgctgcgggt ccgtatagca cttgctctta aagggatcga ttacgagttc atcgaagaga 180 
acatgcaaaa taagagccat ctgctgctgc aatcgaaccc tgttaacaac aagattccag 240 
tgctcatcca caatggaaag cctgtttgtg agtccatgat tattgtgcag tacatagacg 300 
aggcatggga cacgaaggcg ccagttctaa tgcccaaaga tccttatgac cgacccattg 360 

10 cccgcttctg ggctgccttt gtagacgata agctgctgcc atgccttcgg ggagttttca 420 
agggccaggg agagcagcaa cagaaagcgt tggaagaatc gggggcaagc tttcttttac 480 
tggaggaggc tctgcgaacg agccactgct tctcgggaaa accgtatttc ggaggagatg 540 
agatcggctt tcttgacatc gcattgggtg gtatgttagc atttgtcaaa gccctcgaga 600 
aggttactaa tttagtttta atagaccagg agaagatgcc gctgttaagc acatggatga 660 

15 atcgattctg tgaggccgat ggagtgaaag acgttatgcc ggatccggcc aagttgcagg 720 
aatttatatc cgccatcaga gtcagattta catcaccacc tgctgccaat taggcgaagc 780 
cattcggcca ttaaatggat gttatcgtcc gcattgtttt tggttttatg ctgtcagttt 840 
gaatgttgtt atgctttttg aattgttggt gtttaatggg aataattcta tcgcccaatc 900 
tagcaccgtg tgattcgcta tcagttctca ccgtgttcca tgtaacttcc atttcatgat 960 
20 tttgggagat agaacaaaaa ttcatggaaa tgtgtgttag tgttttatat ttgaaaaggg 1020 
ttggatttgc agagaatgga 1040 



<210> 33 
<211> 902 
25 <212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence : Wheat AF004358- 

30 

<400> 33 

ggcacgaggc aagcacaagc agctaaagca atctcagtgg tcttcaaaac acacacacac 60 

acactgacac acatcgatcg aggtagttag agatggccgg aggagatgac ctgaagctgc 120 

tcggcgcatg gccaagccca tttgttacca gggtgaagct ggcgctcgcc ctgaagggcc 180 

35 tgagctacga ggacgtggag gaggacctgt acaagaagag tgagcttctc ctcaagtcca 240 

acccggtgca caagaagata cccgtgctca tccacaacgg agccccggtc tgcgagtcca 300 

tgatcattct ccagtacatc gacgaagtgt tcgccagcac cggcccgtcc cttcttccag 360 

cggaccccta cgagcgcgcc attgctcgct tctgggtggc ttacgttgac gacaagctgg 420 

tagccccatg gaggcagtgg ttgaggggca agacagagga ggagaaatcc gagggaaaga 480 

40 agcaggcgtt cgccgcggtg ggggtcctcg aaggggccct gagggagtgc tccaagggag 540 

ggggcttctt cggtggcgac ggcgtcgggc tcgtcgacgt tgcgctggga ggcgtgctgt 600 

cgtggatgaa ggtgaccgag gcgctgtctg gtgacaagat tttcgacgcc gccaagactc 660 

cgctcctggc cgcatgggtg gagcgcttca ttgagctcga cgcggccaag gccgccctgc 720 
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cggacgtggg caggctgctt gagtttgcca aggcacgaga ggctgccgct gcagcc:cca 780 
agtgagccgc cagcacatat ccagaataat taaaatttgt tattttaaat gttgtt~gtt 840 
cggctgcttg atgtaataat gtagtaactg atgtcgtcca tttaaaaaaa aaaaaaaaaa S00 



"3 

<210> 34 
<211> 1127 
<212> DNA 

<213> Artificial Sequence 

10 

<220> 

<223> Description of Artificial Sequence : Spruce AF051238 
<400> 34 

15 gacacatata actcacaggc aaaaaaatat tcaattacaa tacattttcc gtgcaarggc 60 
gacggaggct tgtggagaaa aggggcaagt gcagctgtta gg'tgggagtc tgagttcctt 120 
cgtgctgcgg gttcgcatag cacttgctct taaaggcatc gattacgagt tcatcgaaga 180 
gaacatgcaa aataggagcc agctgctgct gcaatcaaac cctgttcaca ggaaga:ucc 240 
agtgcttatc cacaatggaa agcccgtttg tgaatccatg attattgtgc agtacatcga 300 

20 tgaggcatgg gacactaagg cacccaatct tatgcccaaa gatccatatg acctagccat 360 
tgcccgcttc tgggctgcct ttgtagacga taagctcgtt ccatgtatgc ggagagtttt 420 
cgctggccat ggcgagcagc tacagaaaga agcggaagat ctggttacaa actttcattt 430 
gatagaggaa gctctgcgaa ccaacagctg cttctcagga aaagcgtatt ttggaggcga 540 
taagataggc ttgcttgaca tcgcattggg tggtatgttg gcggttctca aaggccccga 600 

25 gaaggctacc gataccgtta taatagatcc ggagaagatg ccgttgctga gcgcatggat 660 
ggaccgattt tgtcaatcca atggagtgaa agaagtaatg cccgatccgg ccaagcagct 720 
ggaatctcta tcagctagga gagccagact tgcatcacct gctggcaatt agggcaagcc 780 
atgtcggcct tataaactga ggatagacag atggattata aacttattat tcgtactact 840 
tgtcctttta ttcatgtggt cagcttcagc gttttaattc ttgctgtttt atgtgaataa 900 

30 gtctgaataa tgtttgggtg aatctcgcct gtactatagc tggcattcac ctgtttattg 960 
tacgctgatt tagtttgaac aagttttggt gaatctcccc tgtactgaag ctggcattcc 1020 
cctgttcaat gtgcgctgat ttagtttgaa taagtttttg atgaatctcg cttgtactgt 1080 
agctatgtgc gatgattttt aatgccatag aaacgagaat gaaatgc 1127 



35 
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CLAIMS 

1 . A protein comprising the amino acid sequence depicted as SEQ ID No. 1 or a protein 
variant having at least about 70% identity therewith wherein the said protein or 
variant is capable of catalysing the addition of Beta-alanine onto gamma 
glutamylcysteine. 

2. A protein variant according to claim 1 having a Km for Beta-alanine which is less 
than the said variants Km for glycine when calculated using the same method. 

3. A protein variant according to claim 2 having a Km for Beta-alanine which is less 
than or equal to about 0.8mM and a Km for glycine which is higher than 0.8mM 
when calculated using the same method. 

4. A protein variant according to claim 2 or 3 which variant comprises an amino acid 
sequence selected from the group depicted as SEQ ID No. 2, 3, 4 or 5. 

5. A polynucleotide comprising a region encoding the protein or protein variant 
according to any one of claims 1 to 4. 

6. A polynucleotide according to claim 5 which comprises the sequence depicted as 
SEQ ID No. 6. 

7. A Glutathione-S-transferase comprising the amino acid sequence depicted as SEQ ID 
No. 7, 8, 9 or 10. 

8. A polynucleotide comprising a region which encodes at least one of the amino acid 
sequences according to claim 7. 
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9. A polynucleotide sequence which is the complement of one which will hybridise to 
the polynucleotide according to claim 8 under stringent conditions and which 
polynucleotide sequence still encodes a Glutathione S transferase. 

5 10. A polynucleotide according to claim 9 which comprises the sequence depicted as 
SEQ ID No. 11 or 12 or 13 or 14. 

11. A polynucleotide sequence which is the complement of one which binds to SEQ ID 
No. 1 1 or 12 or 13 or 14 at a temperature of between 60°C and 65°C in 0.3 strength 

10 citrate buffered saline containing 0.1% SDS followed by rinsing at the same 

temperature with 0.3 strength citrate buffered saline containing 0.1% SDS wherein 
said polynucleotide sequence still encodes a functional Glutathione S transferase 
enzyme with the proviso that the said polynucleotide sequence or the amino acid 
sequence which it encodes, is not a sequence selected from the group of sequences 

15 listed under accession number; P321 10, U20809, Q03663, P321 1 1, P46421, 

AJ000923, AF004358, AC000348 or AF051238. 

12. A DNA construct comprising in sequence a plant operable promoter operably linked 
to a polynucleotide according to either of claims 5 or 6 operably linked to a 

20 terminator region. 

13. A DNA construct comprising in sequence a plant operable promoter operably linked 
to a polynucleotide or polynucleotide sequence according to any one of claims 8 to 
11, operably linked to a terminator region. 

25 

14. A DNA construct comprising a first region comprising in sequence a plant operable 
promoter operably linked to a polynucleotide according to claim 5 or claim 6 and a 
terminator region and a second region comprising in sequence a plant operable 
promoter operably linked to a polynucleotide or polynucleotide sequence according 

30 to any one of claims 8 to 1 1 and a terminator region. 



PPD50449 



-41 - 



15. A DNA construct according to claim 14 wherein the said first region comprises a 
polynucleotide encoding the amino acid sequence depicted as SEQ ID No. 1 and the 
second region comprises a polynucleotide encoding the amino acid sequence depicted 
as SEQ ID No. 10. " — — — 

16. A DNA construct according to any one of claims 12 to 15 which further comprises a 
third region which provides for the production of a protein which acts as a selectable 
marker. 

17. A DNA construct according to any one of claims 12 to 16 wherein the plant operable 
promoter is selected from the group consisting of: CaMV35S, FMV35S, NOS, OCS 7 
Patatin, E9, alcA/alcR switch, GST switch, RMS switch, oleosin, ribulose 
bisphosphate carboxylase-oxygenase small sub-unit promoters. 

18. A method of providing plants which are resistant and/or tolerant to an agrochemical 
comprising: 

(a) inserting into the genome of plant a polynucleotide or a polynucleotide sequence 
. .according to any one of claims 8 to 1 1 or a DNA construct according to any one of 

claims 13 to 17; and 

(b) regenerating morphologically normal fertile plants or plant parts therefrom; and 

(c) applying to said plants or plant parts an amount of said agrochemical which is 
phytotoxic to control like plants and selecting those plants or plant parts which are 
resistant to said agrochemical. 

19. A method of providing plants which are resistant and/or tolerant to an agrochemical 
comprising: 

(a) inserting into the genome of a plant which plant provides for the production of a 
functional Glutathione S transferase, a polynucleotide according to either of claims 5 
or 6 or a DNA construct according to claim 12; and 

(b) regenerating morphologically normal fertile plants or plant parts therefrom; and 
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(c) applying to said plants or plant parts an amount of said agrochemical which is 
phytotoxic to control like plants and selecting those plants or plant parts which are 
resistant to said agrochemical. 



5 20. A method according to claim 18 or 19 wherein the said agrochemical comprises 
acifluorfen and/or chlorimuron-ethyl and/or fomesafen and/or acetochlor and/or 
metolachlor. 



21 . Plants or plant parts obtained according to the method of any one of claims 1 8 to 20. 

10 

22. Plants or plant parts according to claim 21 wherein said plants or plant parts are 
selected from the group consisting of: melons, mangoes, soybean, cotton, tobacco, 
sugarbeet, oilseed rape, canola, flax, sunflower, potato, tomato, alfalfa, lettuce, maize, 
wheat, sorghum, rye, bananas, barley, oat, turf grass, forage grass, sugar cane, pea, 

15 field bean, rice, pine, poplar, apple, peaches, grape, strawberries, carrot, lettuce, 

cabbage, onion, citrus or nut plants. 

23. Use of a polynucleotide according to any one of claims 5,6, 8 or 10 or a 
polynucleotide sequence according to claim 9 or 1 1 or a DNA construct according to 

20 any one of claims 12 to 1 7 in a method of producing plants which are resistant and/or 

tolerant to a herbicide comprising acifluorfen and/or chlorimuron-ethyl and/or 
fomesafen and/or acetochlor and/or metolachlor. 



24. Herbicide resistant plants obtained by the method of any one of claims 1 8 to 20. 

25 

25. A method of selectively controlling weeds in a field said field- comprising crop plants 
and weeds said method comprising applying to said field an agriculturally acceptable 
formulation of an agrochemical comprising acifluorfen and/or chlorimuron-ethyl 
and/or formasafen and/or acetochlor and/or metolachlor or a functionally related 

30 analogue thereof, characterised in that the said crop plants are the plants according to 

claim 21, 22 or 24. 
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26. Use of an agrochemical comprising acifluorfen and/or chlorimuron-ethyl and/or 

fomesafen and/or acetochlor and/or metolachlor or a functional analogue thereof to 
selectively control weeds in a fi^fiTwfiicIrfield comprises crop plants and weeds 
5 comprising applying to said field an agriculturally acceptable formulation of said 

agrochemical in an amount which is sufficient to be phytotoxic to said weeds but not 
said crop plants characterised in that said crop plants are the plants according to claim 
21,22 or 24. 

10 27. A protein comprising the sequence depicted as SEQ ID No. 7 or a protein variant 
having a Smith- Waterman score greater than 1094 in the SWISSPROT database 
calculated using the FASTA3 algorithm wherein the said protein variant still encodes 
a Glutathione-S-transferase. 

A protein comprising the sequence depicted as SEQ ID No. 8 or a protein variant 
having a Smith- Waterman score greater than 619 in the SWISSPROT database 
calculated using the FASTA3 algorithm wherein the said protein variant still encodes 
a Glutathione-S-transferase. 

A protein comprising the sequence depicted as SEQ ID No. 9 or a protein variant 
having a Smith- Waterman score greater than 671 in the SWISSPROT database 
calculated using the FASTA3 algorithm wherein the said protein variant still encodes 
a Glutathione-S-transferase. 

A protein comprising the sequence depicted as SEQ ID No. 10 or a protein variant 
having a Smith-Waterman score greater than 766 in the SWISSPROT database 
calculated using the FASTA3 algorithm wherein the said protein variant still encodes 
a Glutathione-S-transferase. 




15 28. 



20 29. 



# 
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